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(54) Motion vector detection apparatus 

(57) In a motion vector detection apparatus, cas- 
cade-connected processor elements each calculates 
the absolute value of the difference between each of a 
plurality of pixels which compose a picture and a corre- 
sponding one of the same number of pixels included in 
a block, and also performs cumulative addition of the 
difference absolute values in the block. These opera- 
tions are performed for each of the blocks within the pre- 
determined search area. A comparative means 
repeatedly compares the cumulative addition values of 
two blocks obtained sequentially in the processor ele- 
ment at the f inal stage, and selects the smaller one of 
the cumulative addition values. A subtracter compares 
the smaller cumulative addition value with a setting 
value. When the smaller cumulative addition value is 
smaller than the setting value, a control circuit halts the 
supply of clock signals to the processor elements and 
the comparison means, so as to halt the entire opera- 
tions of the apparatus. Consequently, the power con- 
sumption can be reduced. 
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Description 

BACKGROUND OF THE INVENTION 

The present invention relates to an improvement of 
a motion vector detection apparatus which detects 
motion vectors used for the prediction in motion com- 
pensation prediction, which is one of the motion picture 
compression techniques. 

In transmitting and storing motion picture having a 
targe amount of data, the motion picture compression 
techniques are inevitable to reduce the large amount of 
data. The motion picture compression techniques 
include a compression technique called motion com- 
pensation prediction. According to the motion compen- 
sation prediction, the level of data redundancy in motion 
picture is lowered in the direction of the time axis by 
extracting the displacement between two highly-corre- 
lated pictures, so as to compress the amount of data. 
The displacement between the highly-correlated pic- 
tures in the motion compensation is referred to as a 
motion vector, which is generally detected by a block 
matching method. The block matching method is briefly 
described with reference to Fig. 5. 

The block matching method is a method of evaluat- 
ing the correlation between two pictures in predeter- 
mined block units. In Fig. 5, when a motion vector for a 
block (previous picture) 502 on Picture A 501 is 
detected, an evaluation value is calculated for each one 
of a plurality of blocks within a predetermined search 
area 504 on Picture B 503. As a result of the calculation, 
when block X 505 shown in Fig. 5 has turned out to have 
the best evaluation value (the highest correlation), the 
displacement from the position on Picture B 503 on 
which the previous picture block 502 is projected to 
block X 505 is detected as a motion vector 506. 

As the evaluation value indicative of the degree of 
correlation, a total sum is generally used which is 
obtained by cumulatively adding the absolute value of 
the difference between each of the pixels included in a 
block and a spatially corresponding one of the same 
number of pixels included in another block. As the eval- 
uation value is smaller, the two blocks have a smaller 
difference, that is, higher correlation. 

As a conventional motion vector circuit for detecting 
motion vectors by the above-mentioned block matching 
method, there is a well-known technique disclosed in 
Japanese Laid-Open Patent Application No. 7-184210. 

Fig. 1 shows the entire structure of a motion vector 
detection apparatus, where one block includes 256 pix- 
els, and each pixel has an 8-bit value. The apparatus 
shown in Fig. 1 includes cascade-connected processor 
elements (hereinafter PEs) 601 which are as many as 
the pixels included in one block: 256, a bus R 602 for 
transferring the pixel data of one block within the search 
area 504, a bus S 603 for transferring the pixel data of 
the previous picture block 502, a clock line 604 for sup- 
plying each of the PEs 601 with an operation clock, and 



a comparison means 605 for receiving an output of the 
PE at the final stage and comparing the previous value 
and the current value of the output. 

Fig. 9 show the conventional internal structure of 
5 each of the PEs 601 in the above-mentioned motion 
vector detection apparatus. The apparatus shown in 
Fig. 9 includes a difference absolute value calculator 

610 for calculating the absolute value of the difference 
of the pixel data to be transferred via the bus R 602 and 

10 the bus S 603, an adder 61 1 for adding an output value 
of the difference absolute value calculator 610 and an 
output value of the PE at the previous stage together 
and for outputting the addition results to the PE at the 
following stage, a register 612 for storing the pixel data 

75 of the previous picture block to be transferred via the 
bus S 603, a pipeline register 613 for pipeline process- 
ing the calculations of the difference absolute value cal- 
culator 610 and the adder 61 1 , and a pipeline register 
614 for pipeline processing the calculation of the adder 

20 611 of each of the PEs 601. 

The operations of the above-mentioned motion vec- 
tor detection apparatus will be described as follows. The 
leading pixel data in the previous picture block 502 are 
stored in the register 612 and then transferred to the dif- 

25 ference absolute value calculator 610 in the PE at the 
first stage (PE0). Also, the leading pixel data of a block 
(for example, block X 505) within the search area 504 
are transferred to the difference absolute value calcula- 
tor 610 in the PE at the first stage. The difference abso- 

30 lute value calculator 610 calculates the difference 
absolute value between the received two pixel data, and 
transfers the calculation results to the pipeline register 
613. The adder 611 receives the difference absolute 
value from the register 613 and transfers it to the pipe- 

35 line register 61 4 at the following stage, so that the differ- 
ence absolute value is transferred to the PE at the next 
stage (PE1). 

Then, the second pixel data following the above- 
mentioned leading pixel data in the previous picture 

40 block 502 are transferred to the PE at the second stage 
(PE1). Also, the next pixel data in the block X 505 within 
the search area 504 are transferred to the PE at the 
second stage. In the PE at the second stage, the differ- 
ence absolute value between these two pixel data is cal- 

45 culated and the results are stored in the pipeline register 

613 in the same manner as in the PE at the first stage. 
Furthermore, in the PE at the second stage, the adder 

61 1 adds the difference absolute value of the leading 
pixel data transferred from the PE at the first stage and 

so the difference absolute value of the second pixel data 
together. The results are stored in the pipeline register 

614 and then transferred to the PE at the third stage 
(PE2). 

These operations are repeated hereafter until the 
55 total sum of the difference absolute values between all 
the pixels in the previous picture block 502 and in block 
X 505 within the search area 504 is obtained in the PE 
at the final stage (PE255). 
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These operations are sequentially performed for 
every block within the search area 504 although Fig. 5 
shows only one other block Y 507. 

The comparison means 605 receives the total sum 
of the difference absolute values outputted from the PE 
at the final stage, compares the total sum in the previ- 
ous block (for example, block X 505) and the total sum 
in the current block (for example, block Y507), and 
chooses the smaller of the two total sums. When the 
total sums of the difference absolute values in all the 
blocks within the search area 504 have been calculated 
and compared, the comparison means 605 obtains the 
smallest total sum, and outputs, as a motion vector, the 
displacement between the block having the smallest 
total sum and the previous picture block 502. 

However, the above-mentioned conventional 
motion vector detection apparatus has drawbacks of 
demanding huge power consumption because of a 
large amount of calculation processing needed and of 
requiring large circuit scales. These drawbacks will be 
detailed as follows. 

In Fig. 5, Picture A 501 is one of the 30 pictures dis- 
played in one second and its size is 720 pixels x 480 
lines in the case of standard TV motion picture. Since 
the standard size of the previous picture block 502 is 16 
pixels x 16 lines in the motion vector detection used for 
the motion compensation prediction of the motion pic- 
ture compression, the number of blocks included within 
the search area 504 amounts to 1024. Under these con- 
ditions, the number of blocks subject to the block match- 
ing amounts to 40,500 (=720 x 480 x 30 - 16 - 16) per 
second. A block matching performed for each block 
requires 262,144 (=1,024 x 16 x 16) times of calcula- 
tions for the difference absolute values and the total 
sums. Consequently, the motion vector detection by the 
block matching method requires as many as 
10,616,832,000 (=40, 500 x 262,144) times of calcu- 
lates per second for the difference absolute values and 
the total sums, which demands huge power consump- 
tion. 

I n order to perform the calculations of the difference 
absolute values and the addition of these difference 
absolute values (total sum) with a high precision, it is 
necessary to make the difference absolute value calcu- 
lator, the adder, the pipeline register, and other units in 
each PE have large bit widths. It is also inevitable to pro- 
vide as many as 256 PEs each including these units, 
which leads to undesirable expansion of the circuit 
scales. 

SUMMARY OF THE INVENTION 



The object of the present invention is to provide a 
motion vector detection apparatus which has reduced 
circuit scales and detects motion vectors with low power 
consumption. 

In order to achieve the object, the present invention 
has the following features. In the motion compensation 



prediction, when a motion vector is a displacement 
between the previous picture block and a block which 
has sufficiently high correlation with the previous picture 
block, even if it is not the block having the highest corre- 
5 lation, the level of data redundancy between the pic- 
tures can be lowered sufficiently and motion pictures 
can be compressed at a high compression rate. 

These features bring about the following effects. 
Firstly, in calculating the evaluation values of a plurality 
10 of blocks within a search area, if it is possible to obtain 
an evaluation value which is not the best, but is as good 
as the best evaluation value, in other words, if the eval- 
uation value does not need to have a precision (resolu- 
tion) higher than indicating the level of correlation, the 
75 bit width of the adder and other units in each PE can be 
reduced, and accordingly both the circuit scales and the 
power consumption can be reduced. 

Secondly, in calculating the total sum of the differ- 
ence absolute values of pixels with the use of a plurality 
20 of PEs, a predetermined setting value (threshold value) 
can be set as the addition results of the adder in each 
PE, and when calculated addition results exceed the 
setting value, that is, when low correlation can be pre- 
dicted before the total sum is obtained, the useless 
25 addition processing of the difference absolute values in 
the remaining PEs can be halted. As a result the power 
consumption can be reduced. 

Thirdly, when a block having sufficiently high corre- 
lation with the previous picture block is detected, it is 
30 unnecessary to detect a block having higher correlation. 
Therefore, the calculation of the evaluation values here- 
after can be hatted, which accordingly decreases the 
nurrfcer of calculations, and reduces the consumption 
power. 

35 In the conventional motion vector detection appara- 
tus, the adder 611 on each of PE0-PE255 has a bit 
width which can express the largest total sum of the dif- 
ference absolute values obtained in the PE at the final 
stage. However, the adder 611 adds the difference 
40 absolute value calculated in the PE at the present stage 
to the difference absolute value obtained by the adder 
61 1 in the PE at the previous stage, that is, the total sum 
of the difference absolute values calculated so far. 
Therefore, the adder 61 1 in a PE at a later stage calcu- 
45 lates a larger total sum, while the adder 61 1 in a PE at 
an earlier stage can manage with a smaller bit width. As 
a result, it is possible to make the adder 61 1 in a PE at 
an earlier stage have a smaller bit width, so as to reduce 
the circuit scale. 
so In view of these aspects, the motion vector detec- 
tion apparatus of the present invention for detecting cor- 
relation between a picture consisting of a plurality of 
pixels and a plurality of blocks each consisting of the 
same number of pixels as the picture, the plurality of 
55 blocks being included within a predetermined search 
area containing a larger number of pixels than the pic- 
ture is characterized by comprising: a plurality of PEs 
which are cascade-connected, each of the PEs calcu- 
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lating a difference absolute value between each of the 
plurality of pixels included in the picture and a corre- 
sponding one of pixels included in every one of the plu- 
rality of blocks, and performing cumulative addition of 
the difference absolute values for every one of the plu- 5 
rality of blocks sequentially; a comparison means for 
repeatedly comparing cumulative addition values of the 
difference absolute values of two of the plurality of 
blocks by subtraction, the cumulative addition values 
being obtained in the PE at a final stage, and for repeat- w 
edly selecting the smaller cumulative addition value; 
and a restriction means for placing restrictions on calcu- 
lation of at least one of the comparison means and the 
plurality of PEs, so as to give a cumulative addition 
value finally obtained in the comparison means a possi- 15 
bility of becoming a sufficiently small cumulative addi- 
tion value even if the cumulative addition value is not 
smallest. 

The above-mentioned motion vector detection 
apparatus of the present invention is also characterized 20 
in that each of the plurality of PEs comprises: a differ- 
ence absolute value calculation means for calculating 
the difference between each of the plurality pixels 
included in the picture and a corresponding one of pix- 
els contained in one of the plurality of blocks, and fur- 25 
ther calculating the absolute value of the difference, so 
as to obtain a difference absolute value having a prede- 
termined number of bits; an addition means for adding 
the difference absolute value obtained by the difference 
absolute value calculation means and an output of the 30 
PE at the previous stage together, and for transferring 
addition results to the PE at the next stage; and the 
restriction means being composed of a processing 
means for truncating or rounding predetermined low- 
order bits of the difference absolute value obtained by 35 
the difference absolute value calculation means, so as 
to obtain the difference absolute value having high- 
order bits. 

The above-mentioned motion vector detection 
apparatus of the present invention is also characterized 40 
in that each of the plurality of PEs comprises: a differ- 
ence absolute value calculation means for calculating 
the difference between each of the plurality pixels 
included in the picture and a corresponding one of pix- 
els contained in one of the plurality of blocks, and fur- 45 
ther calculating the absolute value of the difference, so 
as to obtain a difference absolute value; an addition 
means for adding the difference absolute value 
obtained by the difference absolute value calculation 
means and an output of the PE at the previous stage so 
together, and for transferring addition results to the PE 
at the next stage, the addition means being composed 
of a calculator whose bit width is small; and the restric- 
tion means comprising the calculator which composes 
the addition means and a forced halt means for forcibly ss 
halting the cumulative addition of the difference abso- 
lute values in the PEs at the following stages when a 
value calculated by the calculator is larger than the 



maximum value which can be expressed with the bit 
width. 

The above-mentioned motion vector detection 
apparatus of the present invention is also characterized 
in that the calculator whose bit width is small is com- 
posed of a calculator which outputs an overflow signal 
when a calculated value is larger than the maximum 
value which can be expressed with the bit width; and the 
forcible halt means receives the overflow signal from the 
calculator in the PE at the previous stage, ignores addi- 
tion results of the addition means, and transfers the 
overflow signal in each of the PEs at the current stage 
and the previous stage to the PE at the next stage. 

The above-mentioned motion vector detection 
apparatus of the present invention is also characterized 
in that the restriction means compares a setting value 
with the smaller cumulative addition value selected by 
the comparison means, and halts the entire operations 
of the motion vector detection apparatus when the 
smaller cumulative addition value is smaller than the 
setting value. 

The above-mentioned motion vector detection 
apparatus of the present invention is also characterized 
in that the restriction means comprises: a signal gener- 
ation means for generating a halt signal when the 
smaller cumulative addition value is smaller than the 
setting value; and a control means for halting the supply 
of a clock signal to each of the PEs and the comparison 
means when the signal generation means generates 
the halt signal. 

The above-mentioned motion vector detection 
apparatus of the present invention is also characterized 
by further comprising: a plurality of registers for storing 
data, which are provided in each of the plurality of PEs; 
a data transfer control circuit provided on a data bus for 
transmitting data of the plurality of pixels composing one 
block to each of the plurality of PEs; and the restriction 
means dividing each bit string of the plurality of registers 
and the data transfer control circuit into a plurality of 
contiguous bit strings and controlling data writing or 
data transferring of each of the plurality of contiguous bit 
strings independently. 

The above-mentioned motion vector detection 
apparatus of the present invention is also characterized 
in that the restriction means generates a switch signal, 
so as to halt the supply of a clock signal to predeter- 
mined low-order bits of each bit string of the plurality of 
registers and the data transfer control circuit. 

Another motion vector detection apparatus of the 
present invention for detecting correlation between a 
picture consisting of a plurality of pixels and a plurality of 
blocks each consisting of the same number of pixels as 
the picture, the plurality of blocks being included within 
a predetermined search area containing a larger 
number of pixels than the picture is characterized by 
comprising: a plurality of PEs which are cascade-con- 
nected, each of the plurality of PEs calculating the dif- 
ference absolute value between each of the plurality of 
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pixels included in the picture and a corresponding one 
of pixels included in every one ol the plurality of blocks, 
and performing cumulative addition of the difference 
absolute values for every one of the plurality of blocks 
sequentially, each of the plurality of PEs comprising: a 5 
difference absolute value calculation means for calcu- 
lating the difference between each of the plurality pixels 
included in the picture and a corresponding one of pix- 
els contained in one of the plurality of blocks, and fur- 
ther calculating the absolute value of the difference, so w 
as to obtain a difference absolute value; an addition 
means for adding the difference absolute value 
obtained by the difference absolute value calculation 
means and an output of the PE at the previous stage 
together, and for transferring addition results to the PE 75 
at the next stage, and the addition means provided in 
the PE at the later stage of the plurality of PEs having a 
larger bit width. 

The above-mentioned motion vector detection 
apparatus of the present invention is also characterized 20 
in that the addition means provided in each of the plural- 
ity of PEs has the smallest bit width that does not cause 
an overflow. 

According to the above-mentioned construction of 
the present invention, the restriction means puts restric- 25 
tions on at least one of the calculation in the plurality of 
PEs and the subtraction processing in the comparison 
means, so that both the power consumption and the cir- 
cuit scales can be reduced. In that case, when the 
cumulative addition value of the difference absolute val- 30 
ues finally obtained in the comparison means is a value 
close to the smallest value, the block having the close 
value can be used as a block having sufficient correla- 
tion in detecting a motion vector so as to lower the level 
of the data redundancy between pictures sufficiently, 35 
and to compress the motion pictures at a high compres- 
sion rate. 

Since the addition means in each PE has a small bit 
width, the circuit scale can be small, which accordingly 
reduces the power consumption. 40 

When the addition results of the adder exceed the 
setting value in a PE, or when low correlation is pre- 
dicted before the total value is finally obtained by contin- 
uing the remaining addition, the useless addition 
processing of the difference absolute values in the 4t 
remaining PEs is halted, which reduces the power con- 
sumption. 

Furthermore, when a block having sufficiently high 
correlation with the previous picture block is detected, a 
motion vector is detected based on this block, which so 
makes it unnecessary to detect a block having higher 
correlation. Consequently, the calculation of the cumu- 
lative addition value of the difference absolute values in 
the remaining blocks is halted, so that the number of 
calculations is effectively decreased, which reduces the ss 
consumption power. 

In addition, the operations of the registers and the 
low-order bits of the data transfer control circuit are for- 



cibly halted by means of a switching signal. This enable 
the choice between obtaining a desired precision with 
all the bits and obtaining a reduced precision with the 
high-order bits only. The latter processing can reduce 
the power consumption. 

The bit width of the addition means is set larger in a 
PE at a later stage. Consequently, the circuit scale can 
be efficiently reduced, and accordingly, the power con- 
sumption is reduced, as compared with the conven- 
tional apparatus where the bit width of the addition 
means at the final stage is adopted in the addition 
means at all the other stages. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Rg. 1 shows the entire structure of the motion vec- 
tor detection apparatus of the first embodiment of the 
present invention. 

Fig. 2 shows the structure of a PE which is a com- 
ponent of the motion vector detection apparatus of the 
same. 

Rg. 3 shows the structure of a PE of the motion 
vector detection apparatus of the second embodiment 
of the present invention. 

Rg. 4 (a) shows the structure of the motion vector 
detection apparatus of the third embodiment of the 
present invention, and Rg. 4 (b) shows the internal 
structure of the comparison means which is a compo- 
nent of the motion vector detection apparatus of the 
same. 

Rg. 5 shows a diagram which illustrates the block 
matching method. 

Rg. 6 shows the internal structure of a PE of the 
motion vector detection apparatus of the fourth embod- 
iment of the present invention. 

Rg. 7 shows the control of the clock signal in the 
motion vector detection apparatus of the same. 

Rg. 8 (a) shows the entire structure of the motion 
vector detection apparatus of the fifth embodiment of 
the present invention, and Fig. 8(b) shows a graph indi- 
cating the required number of bits in the adder in each 
PE. 

Rg. 9 shows the structure of a PE which is a com- 
ponent of a conventional motion vector detection appa- 
ratus. 

DETAILED DESCRIPTION OF THE INVENTION 

Each preferred embodiment of the present inven- 
tion will be described as follows with reference to the 
attached drawings. 

(EMBODIMENT 1) 

Rgs. 1 and 2 show the motion vector detection 
apparatus of a first embodiment of the present inven- 
tion. Since the entire structure of the apparatus shown 
in Rg. 1 has been described above, the following 



9 



EP0 854 439 A2 



10 



description will be focused on the internal structure of a 
PE which is a component of the apparatus with refer- 
ence to Fig. 2. 

The PE shown in Fig. 2 includes a difference abso- 
lute value calculator 110 for calculating the absolute s 
value of a difference value so as to output an 8-bit differ- 
ence absolute value, an adder (addition means) 1 1 1 , a 
register 112 which is connected with B terminal of the 
difference absolute value calculator 110 and receives 
the pixel data of the previous picture block 502 shown in 10 
Fig. 5, a pipeline register 113 which is disposed on the 
side of the output of the difference absolute value calcu- 
lator 1 10, and a pipeline register 1 14 which is disposed 
on the side of the output of the adder 111. 

In each of the PEs 601 of the present embodiment, is 
only the high-order 4 bits of an 8-bit difference absolute 
value are entered to the adder 1 1 1 via the pipeline reg- 
ister 1 13. Therefore, the adder 1 1 1 and the two pipeline 
registers 113 and 114 have 4-bit widths. 

Of a difference absolute value outputted from the 20 
difference absolute value calculator 110, the low-order 4 
bits are truncated and only the high-order 4 bits are 
entered to the adder 111, and as a result, a processing 
means 10 outputs a high-order 4-bit difference calcula- 
tion value. The processing means 10 composes a 25 
restriction means for putting restrictions on the calcula- 
tion in the adder 1 1 1 in a PE. 

The operations of the motion vector detection appa- 
ratus of the present embodiment will be described as 
follows. The absolute value of a difference between a 30 
pixel in the previous picture block 502 stored in the reg- 
ister 112 and a pixel in block X 505 which spatially cor- 
responds to the pixel is calculated in the difference 
absolute value calculator 1 10, and the high-order 4 bits 
of the calculated difference absolute value are stored in 35 
the pipeline register 1 13. The difference absolute value 
stored in the pipeline register 113 and an output of the 
adder 1 1 1 (the sum of the difference absolute values) in 
the PE at the previous stage are added together by the 
adder 1 1 1 in the PE at the current stage. The addition 40 
results are then stored in the pipeline register 114. 

in each PE, the low-order 4 bits of a difference 
absolute value obtained in the difference absolute value 
calculator 1 1 0 are truncated, so that a value obtained as 
a result of dividing the difference absolute value of the 45 
difference absolute value calculator 1 10 by 16 is stored 
in the pipeline register 113 while the high-order 4-bit dif- 
ference absolute value is given to the adder 111. Con- 
sequently, in the motion vector detection apparatus of 
the present embodiment, the total sum of the values so 
each obtained as a result of dividing the difference 
absolute value by 16 is outputted to the PE at the final 
stage. As a result, the total sum has a lower precision 
(resolution) than its equivalent in the conventional appa- 
ratus, but is sufficiently indicative of the level of correla- ss 
tion, so that a motion vector obtained based on this total 
sum can be used for the motion compensation predic- 
tion to obtain a sufficiently high compression rate. 



Another advantage of the present embodiment is 
that the adder 111 and the pipeline registers 113 and 
114 have as small as a 4-bit width, as compared with 
the 8-bit width in the conventional apparatus, so that the 
circuit scale can be small. Furthermore, the power con- 
sumption is reduced according to the reduction of the 
circuit scale. 

To be more specific, the circuit scales of the pipe- 
line register 1 1 3, the adder 1 1 1 . and the pipeline regis- 
ter 114 in the present embodiment can be 50%. 75%, 
and 75%, respectively, of those of their equivalents in 
the conventional motion vector detection apparatus. 

Although the low-order 4 bits of an output of the dif- 
ference absolute value calculator 110 are truncated in 
the present embodiment it goes without saying that the 
reduction of the bit width can be also achieved by round- 
ing off to a predetermined bit of the output. 

(EMBODIMENT 2) 

A second embodiment of the present invention will 
be described based on Fig. 3. The description of the 
entire structure of the motion vector detection apparatus 
of the present embodiment will be omitted because the 
structure is the same as that of the first embodiment 
shown in Fig. 1 . 

Fig. 3 shows the internal structure of one of the 
PEs. The PE shown in Fig. 3 includes a difference abso- 
lute value calculator 210, an adder 21 1, a register 212, 
and pipeline registers 213 and 214. While the conven- 
tional adder 61 1 shown in Fig. 9 has a 16-bit width, the 
adder 21 1 of the present embodiment is composed of a 
calculator with a smaller bit width, such as a 10-bit 
width. 

The adder 21 1 outputs an overflow signal 215 when 
a calculated value is larger than the maximum value 
(threshold value) which can be expressed in the 10-bit 
width. An OR circuit 216 receives the overt low signal 
215 from the adder 21 1 and an output signal of the PE 
at the previous stage (not shown) via the signal path 
216a, and ORs these signals. The output of the OR cir- 
cuit 216 is transferred to the OR circuit in the PE at the 
next stage (not shown) via the pipeline register 217. A 
transition prohibition means 218 receives the overflow 
signal 21 5 from the adder 21 1 so as to prohibit the tran- 
sition of the data stored in the pipeline register 214 
which is at the stage following the adder 211. 

When the adder 211 in a PE outputs a value larger 
than the above-mentioned threshold value, the transi- 
tion prohibition means 218 in the PE at the next stage 
prohibits the transition of the data stored in the pipeline 
register 21 4 of the PE at the current stage, based on the 
overflow signal 215. As a result, a forced halt means 20 
is formed which forcibly halts the cumulative addition of 
the difference absolute values in the PE at and after the 
next stage by ignoring the addition results of the adder 
21 1 at the current stage, and which transfers overflow 
signals from the PEs at the current and previous stages 
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to the PE at the following stage via the OR circuit 216 
and the pipeline register 217. 

The following is a description of the operations of 
the motion vector detection apparatus of the present 
embodiment. The adder 2 1 1 outputs the overflow signal 
215 when its calculation results are equal to or larger 
than the threshold value, which is the maximum value 
that can be expressed in 10 bits. The OR circuit 216 
ORs the overflow signal 215 from the adder 21 1 and the 
overflow signal 215 from the PE at the previous stage, 
and transfers the results to the PE at the next stage via 
the pipeline register 217. In each PE, when the overflow 
signal 215 is transferred from the PE at the previous 
stage, the transition prohibition means 218 prohibits the 
transition of the data stored in the pipeline register 214 
by hatting the supply of a clock signal or by another 
method. As a result, when the calculated value is equal 
to or larger than the threshold value, the calculation of 
the total sum hereafter is halted, which realizes the 
reduction of the power consumption. In the next time, 
the calculation of the total sum of the difference abso- 
lute values is performed with regard to each of the 
remaining blocks (for example, block Y 507 shown in 
Fig. 5) within the search area 504. Then, the displace- 
ment between the block having the smallest total sum 
and the previous picture block is detected as a motion 
vector. 

When the total sum calculation is halted based on 
the overflow signal 215, the evaluation value outputted 
from the PE at the final stage becomes indefinite value. 
However, this indefinite evaluation value can be pre- 
vented from being mistakenly regarded as the smallest 
evaluation value, by making the comparison means 605 
receive the overflow signal 215 and making the indefi- 
nite value be excluded from the comparison target. 

When the adder 211 has a 10-bit width in the 
present embodiment, the circuit scales of the adder 21 1 
and the pipeline register 214 can be 62.5% of those of 
their equivalents in the conventional motion vector 
detection apparatus. 

The data transition in the pipeline register 21 4 in the 
PE at the current stage is prohibited by the overflow sig- 
nal 215 in the PE at the previous stage in the present 
embodiment. However, by prohibiting the pipeline regis- 
ter 213 in the PE at the current stage from performing 
data transition by means of the overflow signal 215 in 
the PE two stages before, it becomes possible to pro- 
hibit the adder 21 1 from performing useless calculation 
in the PEs after the current stage and also to prohibit the 
pipeline register 213 from performing data transition. 
Furthermore, by prohibiting the transition of a signal 
entered to A terminal of the difference absolute value 
calculator 210 at the current stage by means of the 
overflow signal 215 in the PE three stages before, it 
becomes possible to prohibit the difference absolute 
value calculator 210 from performing useless calcula- 
tion in the PEs after the current stage. As a result, the 
power consumption can be further reduced. 



(EMBODIMENT 3) 

A third embodiment of the present invention will be 
described based on Figs. 4 (a) and 4 (b). 
s Fig. 4 (a) shows the entire structure of the motion 
vector detection apparatus of the present embodiment. 
This structure shows a case where each block has 256 
pixels, and there are cascade-connected PEs 301 
whose number is the same as that of the pixels. A bus R 
w 302 and a bus S 303 transfer the pixel data required for 
the calculation in each of the PEs 301, and a clock line 
304 supplies each PE with an operation clock. A com- 
parison means 305 enters an output value of the PE at 
the final stage. 

is The present embodiment is characterized by the 
structures of the comparison means 305 and the clock 
line 304. Fig. 4 (b) shows the internal structure of the 
comparison means 305. In the comparison means 305, 
a comparator 310 compares output value A of the PE at 
20 the final stage and holding value B stored in the register 
31 1 by subtraction, and when the subtraction results (A 
—6) is a negative value, rewrites the contents stored in 
the register 31 1 to output value A of the PE at the final 
stage. The data writing of the register 311 is controlled 
25 by the control circuit 31 2. The control circuit 31 2 gener- 
ates a write control signal, based on MSB (Most Signif- 
icant Bit) of an output of the comparator 310 and on a 
clock signal supplied from the clock line 304, and out- 
puts the control signal to the register 31 1 
30 The comparison means 305 further includes a sub- 
tracter 31 3 and a setting value register 314 which stores 
a predetermined setting value as the cumulative addi- 
tion value of the difference absolute values obtained in 
the PE at the final stage. The subtracter 313 subtracts 
35 the smaller cumulative addition value which is actually 
obtained in the PE at the final stage and which is stored 
in the register 31 1 , from the setting value stored in the 
setting value register 314. When the actually obtained 
cumulative addition value is smaller than the setting 
40 value, the subtracter 313 outputs the MSB of the sub- 
traction results as a clock halt signal (halt signal). The 
subtracter 313 functions as a signal generation circuit. 

In Fig. 4 (a), a control circuit (control means) 320 is 
provided on the clock line 304. The control circuit 320 
45 receives the clock hah signal from the subtracter 313 
and halts the supply of the clock signal to the clock line 
304. Consequently, when the clock halt signal is gener- 
ated, each of the PEs 301 , and the comparator 3 1 0 and 
the register 311 of the comparison means 305 do not 
so receive a clock signal and halts their operations. Unlike 
in the present embodiment, it is possible to adopt a con- 
struction where the transfer of pixel data to each PE is 
halted by the halt signal of the subtracter 313. The sub- 
tracter 313 and the control circuit 320 compose a 
55 restriction means which halts the entire operations of 
the motion vector detection apparatus when the value 
stored in the register 31 1 (that is. the smaller cumulative 
addition value actually obtained in the PE at the final 
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stage) is smaller than the setting value. 

Consequently, according to the present embodi- 
ment, when the cumulative addition value actually 
obtained in the PE at the final stage is smaller than the 
setting value, the following operations of the entire s 
apparatus are hatted, and a motion vector can be 
detected by using the block having the cumulative addi- 
tion value smaller than the setting value as the block 
having sufficiently high correlation with the pre/ious pic- 
ture block. Thus, in the present embodiment, when a 10 
block having sufficient correlation with the previous pic- 
ture block is obtained, the calculation of the cumulative 
addition value of the difference absolute values in the 
remaining blocks within the search area 504 can be 
halted. As a result the power consumption can be is 
reduced, as compared with the conventional apparatus 
which always has to calculate the cumulative addition 
value of the difference absolute values in all the blocks 
within the search area 504. 

20 

(EMBODIMENT 4) 

A fourth embodiment of the present invention will 
be described based on Figs. 6 and 7. 

Fig. 6 shows the structure of a PE in the motion 25 
vector detection apparatus of the present embodiment. 
The structure includes a difference absolute value cal- 
culator 710, an adder 71 1 , a register 712 which is dis- 
posed at the previous stage of the difference absolute 
value calculator 710 and which receives the pixel data 30 
of the previous picture block shown in Fig. 5. a pipeline 
register (register) 713 which is disposed at the stage fol- 
lowing the difference absolute value calculator 710, a 
pipeline register (register) 714 which is disposed at the 
stage following the adder 71 1 , and a data transfer con- 35 
trol circuit 715 which receives the pixel data of the 
blocks 505 and 507 within the search area 504 shown in 
Fig. 5 via the data bus R. 

In each of the registers 712, 713, and 714, picture 
data are written in the low-order 4-bit storage area and 40 
the high-order 4-bit storage area independently of each 
other by the writing signal CLK-A and the writing signal 
CLK-B, respectively. These two different writing signals: 
CLK-A and CLK-B are generated in accordance with the 
value f0 w or T) of a mode signal (switch signal) set 45 
from outside as shown in Fig. 7. The writing signal CLK- 
A occurs only when the mode signal MODE is "0", while 
the writing signal CLK-B occurs when the mode signal 
MODE is "0" or "1". Consequently, when the mode sig- 
nal MODE is "1 the registers 712-714 can prohibit the so 
writing of data to the low-order 4-bit storage area. The 
low-order 4-bit storage area of each of the registers 
712-714 is reset at "0" before they are started. 

The data transfer control circuit 715 fixes the low- 
order 4 bits at either value (usually at "0") by the mode ss 
signal MODE and transfers picture data for the high- 
order 4 bits on the data bus R to the difference absolute 
value calculator 710. 



The above-mentioned structure composes a 
restriction means 30 which controls the wiring of data to 
the high-order 4 bits and the low-order 4 bits of the reg- 
isters 712, 713, and 714 separately from each other by 
the two different writing signals: CLK-A and CLK-B, and 
also controls the transfer of the high-order 4 bits and the 
low-order 4 bits of the data transfer control circuit 715 
independently of each other by the mode signal MODE. 

In the motion detection which deals with natural pic- 
tures such as the MPEG standard, even if the low-order 
bits are ignored, almost the same level of a precision as 
the case where all the bits are used can be obtained, so 
that the picture quality is hardly affected. When the 
motion detection dealing with natural pictures is per- 
formed in the present embodiment, the operation of the 
low*order 4 bits of each of the registers 71 2-71 4 and the 
data transfer control circuit 715 can be halted, so as to 
reduce the power consumption without losing satisfac- 
tory p;cture quality. 

in each of the registers 712-714 and the data trans- 
fer control circuit 71 5, each bit string is divided into low- 
order 4 bits and high-order 4 bits; however, a more 
minute control can be realized by dividing the bit string 
into 3 or more. 

(EMBODIMENT 5) 

Finally, a fifth embodiment of the present invention 
will be described based on Figs. 8 (a) and 8 (b). Fig. 8 
(a) shows the entire structure of the motion vector 
detection apparatus of the present embodiment. In the 
structure, each block has 256 pixels, and there are cas- 
cade-connected 256 PEs 401 , a bus R 402 and a bus S 
403 for transferring data required for the calculation in 
each of the PEs 401 , a clock line 404 for supplying each 
PE with an operation clock, and a comparison means 
405 for receiving an output of the PE at the final stage. 

The present embodiment is characterized by the 
internal structure of each PE. All the addition means 
(the adder 1 1 1 shown in Fig. 2) in the PEs 401 have dif- 
ferent bit widths from each other: the adder 111 in the 
PE at a later stage has a larger bit width. In other words, 
in each PE, the dynamic range of the data to be trans- 
ferred from the adder 1 1 1 to the PE at the next stage is 
set larger in the PE at a later stage. Since the opera- 
tions proceed normally unless there is an overflow, it is 
possible to modify the calculation precision in each PE. 
The dynamic range D of the adder 1 1 1 at each stage (i) 
when the pixel data have 8 bits can be expressed by the 
following equation: 

D = log 2 (255 (i+1) ) i = 0, 1. 2. • • • . 255 

Consequently, the smallest bit width of the data to 
be transferred to the PE in the following stage is an inte- 
ger to which the value D is raised. The results are 
shown in Fig, 8 (b). 

In the present embodiment, nine different PEs 
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which output data with a precision of 8, 9, 1 0, 1 1 , 1 2, 1 3, 
14, 15, and 16 bits, respectively, are realized by cas- 
cade-connecting them as shown in Fig. 8 (a). 

According to the above-mentioned structure, the 
number of bits in the adder and the pipeline register (the 5 
adder 1 1 1 and the pipeline register 1 14 shown in Fig. 2) 
can be reduced, as compared with the conventional 
structure. As a result, both the circuit scale and the 
power consumption can be reduced. 

In the present embodiment, as apparent from Fig. 8 10 
(b), the circuit scales of the adder 1 1 1 and the pipeline 
register 1 14 in each PE can be about 90% of those of 
their equivalents in the conventional motion vector 
detection apparatus. 

Each of the above-mentioned embodiments can is 
obtain the effects of reducing the circuit scales and the 
power consumption independently, so that any of the 
embodiments can be combined to enhance the effects. 

Claims 20 

1. A motion vector detection apparatus for detecting 
correlation between a picture consisting of a plural- 
ity of pixels and a plurality of blocks each consisting 
of a same number of pixels as said picture, the plu- 25 3. 
rality of blocks being included within a predeter- 
mined search area containing a larger number of 
pixels than said picture, said motion vector detec- 
tion apparatus comprising: 

30 

a plurality of processor elements which are 
cascade-connected, each of said plurality of 
processor elements calculating a difference 
absolute value between each of the plurality of 
pixels included in said picture and a corre- 35 
sponding one of pixels included in every one of 
the plurality of blocks, and performing cumula- 
tive addition of the difference absolute values 
for every one of the plurality of blocks sequen- 
tially; 40 
a comparison means for repeatedly comparing 
cumulative addition values of the difference 
absolute values of two of the plurality of blocks 
by subtraction, the cumulative addition values 
being obtained in a processor element at a final 45 
stage, and for repeatedly selecting a smaller 
cumulative addition value; and 
a restriction means for placing restrictions on 
calculation of at least one of said comparison 
means and said plurality of processor ele- so 
ments, so as to give a cumulative addition 
value finally obtained in said comparison 
means a possibility of becoming a sufficiently 4 
small cumulative addition value even if the 
cumulative addition value is not smallest. 55 

2. The motion vector detection apparatus of claim 1. 
wherein each of said plurality of processor ele- 



ments comprises: 

a difference absolute value calculation means 
for calculating a difference between each of the 
plurality pixels included in said picture and a 
corresponding one of pixels contained in one of 
the plurality of blocks, and further calculating 
an absolute value of the difference, so as to 
obtain a difference absolute value having a pre- 
determined number of bits; 
an addition means for adding the difference 
absolute value obtained by said difference 
absolute value calculation means and an out- 
put of a processor element at a previous stage 
together, and for transferring addition results to 
a processor element at a next stage; and 
said restriction means being composed of a 
processing means for truncating or rounding 
predetermined low-order bits of the difference 
absolute value obtained by said difference 
absolute value calculation means, so as to 
obtain the difference absolute value having 
high-order bits. 

The motion vector detection apparatus of claim 1, 
wherein each of said plurality of processor ele- 
ments comprises: 

a difference absolute value calculation means 
for calculating a difference between each of the 
plurality pixels included in said picture and a 
corresponding one of pixels contained in one of 
the plurality of blocks, and further calculating 
an absolute value of the difference, so as to 
obtain a difference absolute value; 
an addition means for adding the difference 
absolute value obtained by said difference 
absolute value calculation means and an out- 
put of a processor element at a previous stage 
together, and for transferring addition results to 
a processor element at a next stage, said addi- 
tion means being composed of a calculator 
whose bit width is small; and 
said restriction means comprising said calcula- 
tor which composes said addition means and a 
forced halt means for forcibly halting the cumu- 
lative addition of the difference absolute values 
in processor elements at following stages when 
a value calculated by said calculator is larger 
than a maximum value which can be expressed 
with said bit width. 

The motion vector detection apparatus of claim 3. 
wherein said calculator whose bit width is small is 
composed of a calculator which outputs an overflow 
signal when a calculated value is larger than the 
maximum value which can be expressed with said 
bit width; and 
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said forcible halt means receives the overflow 
signal from said calculator in a processor ele- 
ment at a previous stage, ignores addition 
results of said addition means, and transfers 
the overflow signal in each of processor ele- 5 
merits at a current stage and a previous stage 
to a processor element at a next stage. 

5. The motion vector detection apparatus of claim 1 . 
wherein said restriction means compares a setting 10 
value with the smaller cumulative addition value 
selected by said comparison means, and halts 
entire operations of said motion vector detection 
apparatus when the smaller cumulative addition 
value is smaller than the setting value. 15 

6. The motion vector detection apparatus of claim 5, 
wherein said restriction means comprises: 

a signal generation means for generating a halt 20 
signal when the smaller cumulative addition 
value is smaller than the setting value; and 
a control means for halting supply of a clock 
signal to each of said plurality of processor ele- 
ments and said comparison means when said 25 
signal generation means generates the halt 
signal. 



7. The motion vector detection apparatus of claim 1 
further comprising: 



30 



a plurality of registers for storing data, said plu- 
rality of registers being provided in each of said 
plurality of processor elements; 
a data transfer control circuit provided on a 35 
data bus for transmitting data of the plurality of 
pixels composing one block to each of said plu- 
rality of processor elements; and 
said restriction means dividing each bit string 
of said plurality of registers and said data trans- 40 
fer control circuit into a plurality of contiguous 
bit strings and controlling data writing or data 
transferring of each of the plurality of contigu- 
ous bit strings independently. 

45 

8. The motion vector detection apparatus of claim 7, 
wherein said restriction means generates a switch 
signal, so as to halt supply of a clock signal to pre- 
determined low-order bits of each bit string of said 
plurality of registers and said data transfer control so 
circuit 



pixels than said picture, said motion vector detec- 
tion apparatus comprising: 

a plurality of processor elements which are 
cascade-connected, each of said plurality of 
processor elements calculating a difference 
absolute value between each of the plurality of 
pixels included in said picture and a corre- 
sponding one of pixels included in every one of 
the plurality of blocks, and performing cumula- 
tive addition of the difference absolute values 
for every one of the plurality of blocks sequen- 
tially, each of sab plurality of processor ele- 
ments comprising: 

a difference absolute value calculation means 
for calculating a difference between each of the 
plurality pixels included in said picture and a 
corresponding one of pixels contained in one of 
the plurality of blocks, and further calculating 
an absolute value of the difference, so as to 
obtain a difference absolute value; 
an addition means for adding the difference 
absolute value obtained by said difference 
absolute value calculation means and an out- 
put of a processor element at a previous stage 
together, and for transferring addition results to 
a processor element at a next stage, and said 
addition means provided in a processor ele- 
ment at a later 

stage of said plurality of processor elements 
having a larger bit width. 

10. The motion vector detection apparatus of claim 9, 
wherein said addition means provided in each of 
said plurality of processor elements has a smallest 
bit width that does not cause an overflow. 



9. A .motion, vector detection apparatus for detecting 
correlation between a picture consisting of a plural- 
ity of pixels and a plurality of blocks each consisting 55 
of a same number of pixels as said picture, the plu- 
rality of blocks being included within a predeter- 
mined search area containing a larger number of 
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(54) Motion vector detection apparatus 

(57) In a motion vector detection apparatus, cas- 
cade-connected processor elements each calculates 
the absolute value of the difference between each of a 
plurality of pixels which compose a picture and a corre- 
sponding one of the same number of pixels included in 
a block, and also performs cumulative addition of the 
difference absolute values in the Hock. These opera- 
tions are performed for each of the blocks within the pre- 
determined search area. A comparative means 
repeatedly compares the cumulative addition values of 
two blocks obtained sequentially in the processor ele- 
ment at the final stage, and selects the smaller one of 
the cumulative addition vaJues. A subtracter compares 
the smaller cumulative addition value with a setting 
value. When the smaller cumulative addition value is 
smaller than the setting value, a control circuit halts the 
supply of clock signals to the processor elements and 
the comparison means, so as to halt the entire, opera- 
tions of the apparatus. Consequently, the power con- 
sumption can be reduced. 
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(54) Motion vector detection apparatus 

[57] In a motion vector detection apparatus, cas- 
cade-connected processor elements each calculates 
the absolute value of the difference between each of a 
plurality of pixels which compose a picture and a corre- 
sponding one of the same number of pixels included in 
a block, and also performs cumulative addition of the 
difference absolute values in the block. These opera- 
tions are performed for each of the blocks within the pre- 
determined search area. A comparative means 
repeatedly compares the cumulative addition values of 
two blocks obtained sequentially in the processor ele- 



ment at the final stage, and selects the smaller one of 
the cumulative addition values. A subtracter compares 
the smaller cumulative addition value with a setting 
value. When the smaller cumulative addition value is 
smaller than the setting value, a control circuit halts the 
supply of clock signals to the processor elements and 
the comparison means, so as to halt the entire opera- 
tions of the apparatus. Consequently, the power con- 
sumption can be reduced. 
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